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RAW SEQUENCE LISTING DATE: 
PATENT APPLICATION: US/09/989 , 994 TIME: 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF4\11262002\l989994.raw 

<110> APPLICANT: LIU, Qiang 

<120> TITLE OF INVENTION: POSITION DEPENDENT RECOGNITION OF GNN NUCLEOTIDE 

TRIPLETS BY ZINC FINGERS 
<130> FILE REFERENCE: 8325-0011.20 / S11-US2 
<140> CURRENT APPLICATION NUMBER: 09/989,994 
<141> CURRENT FILING DATE: 2001-11-20 
<160> NUMBER OF SEQ ID NOS : 4085 
<170> SOFTWARE: Patentln Ver. 2.0 
<210> SEQ ID NO: 1 
<211> LENGTH: 25 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<22 0> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: exemplary motif 

<220> FEATURE: 

<221> NAME/KEY: MOD_RES 

<222> LOCATION: (2).. (5) 

<223> OTHER INFORMATION: this region may encompass two to four residues 

consisting of any amino acid 
<220> FEATURE: 
<221> NAME/KEY: MOD_RES 
<222> LOCATION: (7).. (18) 

<223> OTHER INFORMATION: any amino acid 

<220> FEATURE: 

<221> NAME /KEY : MOD_RES 

<222> LOCATION: (20).. (24) 

<223> OTHER INFORMATION: this region may encompass three to five residues 

consisting of any amino acid 
<400> SEQUENCE: 1 

Cys Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

15 10 15 

Xaa Xaa His Xaa Xaa Xaa Xaa Xaa His 

20 25 
<210> SEQ ID NO: 2 
<211> LENGTH: 5 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: peptide linker 
<400> SEQUENCE: 2 
Thr Gly Glu Lys Pro 
1 5 
<210> SEQ ID NO: 3 
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64 <211> LENGTH: 5 

65 <212> TYPE: PRT 

66 <213> ORGANISM: Artificial Sequence 

68 <220> FEATURE: 

69 <223> OTHER INFORMATION: Description of Artificial Sequence: peptide linker 

71 <4 00> SEQUENCE: 3 

72 Gly Gly Gly Gly Ser 

73 1 5 

76 <210> SEQ ID NO: 4 * 

77 <211> LENGTH: 8 

78 <212> TYPE: PRT 

79 <213> ORGANISM: Artificial Sequence 

81 <220> FEATURE: 

82 <223> OTHER INFORMATION: Description of Artificial Sequence: peptide linker 

84 <400> SEQUENCE: 4 

85 Gly Gly Arg Arg Gly Gly Gly Ser 

86 1 5 

89 <210> SEQ ID NO: 5 

90 <211> LENGTH: 9 

91 <212> TYPE: PRT 

92 <213> ORGANISM: Artificial Sequence 

94 <220> FEATURE: 

95 <223> OTHER INFORMATION: Description of Artificial Sequence: peptide linker 

97 <400> SEQUENCE: 5 

98 Leu Arg Gin Arg Asp Gly Glu Arg Pro 

99 1 5 

102 <210> SEQ ID NO: 6 

103 <211> LENGTH: 12 

104 <212> TYPE: PRT 

105 <213> ORGANISM: Artificial Sequence 

107 <220> FEATURE: 

108 <223> OTHER INFORMATION: Description of Artificial Sequence: peptide linker 

110 <400> SEQUENCE: 6 

111 Leu Arg Gin Lys Asp Gly Gly Gly Ser Glu Arg Pro 

112 15 10 

115 <210> SEQ ID NO: 7 

116 <211> LENGTH: 16 

117 <212> TYPE: PRT 

118 <213> ORGANISM: Artificial Sequence 

120 <220> FEATURE: 

121 <223> OTHER INFORMATION: Description of Artificial Sequence: peptide linker 

123 <400> SEQUENCE: 7 

124 Leu Arg Gin Lys Asp Gly Gly Gly Ser Gly Gly Gly Ser Glu Arg Pro 

125 15 10 15 

128 <210> SEQ ID NO: 8 

129 <211> LENGTH: 25 

130 <212> TYPE: PRT 

131 <213> ORGANISM: Artificial Sequence 
133 <220> FEATURE: 
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134 <223> OTHER INFORMATION: Description of Artificial Sequence: exemplary motif 

136 <220> FEATURE: 

137 <221> NAME/KEY: MOD_RES 

138 <222> LOCATION: (2).. (5) 

139 <223> OTHER INFORMATION: this region may encompass two to four residues 

140 consisting of any amino acid 

142 <220> FEATURE: 

143 <221> NAME /KEY : MOD_RES 

144 <222> LOCATION: (7).. (18) 

145 <223> OTHER INFORMATION: any amino acid 

147 <220> FEATURE: 

148 <221> NAME /KEY : MOD_RES 

149 <222> LOCATION: (20).. (24) 

150 <223> OTHER INFORMATION: this region may encompass three to five residues 

151 consisting of any amino acid 
153 <400> SEQUENCE: 8 

W — > 154 Cys Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
155 15 10 15 

W — > 157 Xaa Xaa His Xaa Xaa Xaa Xaa Xaa His 
158 20 25 

161 <210> SEQ ID NO: 9 

162 <211> LENGTH: 30 

163 <212> TYPE: PRT 

164 <213> ORGANISM: Artificial Sequence 

166 <220> FEATURE: 

167 <223> OTHER INFORMATION: Description of Artificial Sequence: mouse transcription 

168 factor Zif268 

170 <400> SEQUENCE: 9 

171 Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser Arg Ser Asp 

172 1 5 10 15 

174 Glu Leu Thr Arg His lie Arg lie His Thr Gly Gin Lys Pro 

175 20 25 30 

178 <210> SEQ ID NO: 10 

179 <211> LENGTH: 28 

180 <212> TYPE: PRT 

181 <213> ORGANISM: Artificial Sequence 

183 <220> FEATURE: 

184 <223> OTHER INFORMATION: Description of Artificial Sequence: mouse transcription 

185 factor Zif268 

187 <400> SEQUENCE: 10 

188 Phe Gin Cys Arg lie Cys Met Arg Asn Phe Ser Arg Ser Asp His Leu 

189 15 10 15 

191 Thr Thr His lie Arg Thr His Thr Gly Glu Lys Pro 

192 20 25 

195 <210> SEQ ID NO: 11 

196 <211> LENGTH: 27 

197 <212> TYPE: PRT 

198 <213> ORGANISM: Artificial Sequence 
200 <220> FEATURE: 
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201 <223> OTHER INFORMATION: Description of Artificial Sequence: mouse transcription 

202 factor Zif268 

204 <400> SEQUENCE : 11 

205 Phe Ala Cys Asp lie Cys Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg 

206 15 10 15 

208 Lys Arg His Thr Lys lie His Leu Arg Gin Lys 

209 20 25 

212 <210> SEQ ID NO: 12 

213 <211> LENGTH: 9 

214 <212> TYPE : DNA 

215 <213> ORGANISM: Artificial Sequence 

217 <220> FEATURE: 

218 <223> OTHER INFORMATION: Description of Artificial Sequence: target DNA 

220 <400> SEQUENCE: 12 

221 gcgtgggcg 9 

224 <210> SEQ ID NO: 13 

225 <211> LENGTH: 94 

226 <212> TYPE: PRT 

227 <213> ORGANISM: Artificial Sequence 
22 9 <220> FEATURE: 

230 <223> OTHER INFORMATION: Description of Artificial Sequence: Sp-1 

231 transcription factor 
233 <400> SEQUENCE: 13 



234 


Pro 


Gly 


Lys 


Lys 


Lys 


Gin 


His 


He 


Cys 


His 


He 


Gin 


Gly 


Cys 


Gly 


Lys 


235 


1 








5 










10 










15 




237 


Val 


Tyr 


Gly 


Lys 


Thr 


Ser 


His 


Leu 


Arg 


Ala 


His 


Leu 


Arg 


Trp 


His 


Thr 


238 








20 










25 










30 






240 


Gly 


Glu 


Arg 


Pro 


Phe 


Met 


Cys 


Thr 


Trp 


Ser 


Tyr 


Cys 


Gly 


Lys 


Arg 


Phe 


241 






35 










40 










45 








243 


Thr 


Arg 


Ser 


Asp 


Glu 


Leu 


Gin 


Arg 


His 


Lys 


Arg 


Thr 


His 


Thr 


Gly 


Glu 
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Phe 


Ala 


Cys 


Pro 


Glu 


Cys 


Pro 


Lys 


Arg 


Phe 


Met 


Arg 


Ser 


Asp 
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65 










70 










75 










80 


249 


His 


Leu 


Ser 


Lys 


His 


He 


Lys 


Thr 


His 


Gin 


Asn 


Lys 


Lys 


Gly 







250 85 90 

253 <210> SEQ ID NO: 14 

254 <211> LENGTH: 9 

255 <212> TYPE: DNA 

256 <213> ORGANISM: Artificial Sequence 

258 <220> FEATURE: 

259 <223> OTHER INFORMATION: Description of Artificial Sequence: target DNA 

261 <400> SEQUENCE: 14 

262 ggggcgggg 9 

265 <210> SEQ ID NO: 15 

266 <211> LENGTH: 100 

267 <212> TYPE: PRT 

268 <213> ORGANISM: Artificial Sequence 

270 <220> FEATURE: 

271 <223> OTHER INFORMATION: Description of Artificial Sequence: Sp-1 consensus 
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<4 00> SEQUENCE: 


15 . 
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Met 


Glu 


Lys 


Leu 


Arg 


Asn 


Gly 


Ser 


Gly 


Asp 


Pro 


Gly 


Lys 


Lys 


Lys 


Gin 
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1 






5 










10 










15 




278 


His 


Ala 


Cys 


Pro 


Glu 


Cys 


Gly 


Lys 


Ser 


Phe 


Ser 


Lys 


Ser 


Ser 


His 


Leu 


279 






20 










25 










30 






281 


Arg 


Ala 


His 


Gin 


Arg 


Thr 


His 


Thr 


Gly 


Glu 


Arg 


Pro 


Tyr 


Lys 


Cys 


Pro 


282 




35 










40 










45 








284 


Glu 


Cys 


Gly 


Lys 


Ser 


Phe 


Ser 


Arg 


Ser 


Asp 


Glu 


Leu 


Gin 


Arg 


His 


Gin 


285 
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55 










60 










287 


Arg 


Thr 


His 


Thr 


Gly 


Glu 


Lys 


Pro 


Tyr 


Lys 


Cys 


Pro 


Glu 


Cys 


Gly 


Lys 


288 


65 










70 










75 










80 


290 


Ser 


Phe 


Ser 


Arg 


Ser 


Asp 


His 


Leu 


Ser 


Lys 


His 


Gin 


Arg 


Thr 


His 


Gin 


291 










85 










90 










95 
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Asn 


Lys 


Lys 


Gly 



























294 100 

297 <210> SEQ ID NO: 16 

298 <211> LENGTH : 9 

299 <212> TYPE: DNA 

300 <213> ORGANISM: Artificial Sequence 

302 <220> FEATURE: 

303 <223> OTHER INFORMATION: Description of Artificial Sequence: Sp-1 consensus 

304 sequence 

306 <400> SEQUENCE: 16 

307 ggggcgggg 9 

310 <210> SEQ ID NO: 17 

311 <211> LENGTH: 9 

312 <212> TYPE: DNA 

313 <213> ORGANISM: Artificial Sequence 

315 <220> FEATURE: 

316 <223> OTHER INFORMATION: Description of Artificial Sequence: example target 

317 DNA 

319 <400> SEQUENCE: 17 

320 gcgggggcg 9 

323 <210> SEQ ID NO: 18 

324 <211> LENGTH: 9 

325 <212> TYPE: DNA 

326 <213> ORGANISM: Artificial Sequence 

328 <220> FEATURE: 

329 <223> OTHER INFORMATION: Description of Artificial Sequence: example target 

330 DNA 

332 <400> SEQUENCE: 18 

333 gcgggggcg 9 

336 <210> SEQ ID NO: 19 

337 <211> LENGTH: 9 

338 <212> TYPE: DNA 

339 <213> ORGANISM: Artificial Sequence 

341 <220> FEATURE: 

342 <223> OTHER INFORMATION: Description of Artificial Sequence: example target 
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DATE: 11/26/2002 
TIME: 13: 53: 00 



Please Note 



Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 
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Pos . 


2, 


5, 


6 


Seq#: 


:4025; 


Xaa 


Pos . 


2, 


5, 


6 


Seq#: 


:4026; 


Xaa 


Pos . 
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Pos . 


2, 


5, 


6 


Seq# 


:4034; 


Xaa 


Pos . 
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VERIFICATION SUMMARY DATE: 11/26/2002 

PATENT APPLICATION: US/09/989,994 TIME: 13:53:00 

Input Set : A:\seqlist.txt 

Output Set: N:\CRF4\11262002\I989994.raw 

L:53292 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4057 after pos.:0 
L:53316 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4058 after pos.:0 
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